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td 1 Introduction 

e 

' This paper concerns the normal approximation of random variables living inside a fixed sum 

of Wiener chaoses associated with a Poisson measure over a Borel measure space. Our main 
I> theoretical tools come from the two papers |26| 12 7| . respectively by Peccati et al. and Peccati 

and Zheng, where the normal approximation of functional of Poisson measures is studied by 
combining two probabilistic techniques, namely the Stein's method and the Malliavin calculus of 
variations. 

T-H We shall focus on conditions implying that a given sequence of random variables satisfies a 

central limit theorem (CLT), where the convergence in distribution takes place in the sense of the 
Wasserstein distance (see Section [l.l| for definitions). Our main concern is to provide analytic 
conditions for asymptotic normality, that is, conditions only involving expressions related to the 
•'-j kernels in the chaotic expansion of a given random variable. In particular, our approach does 

rN not involve computations based on the method of moments and cumulants (with the exception 



Cd of Theorem 4.14 where we deal with Poisson approximations). 



The main contributions of our paper are the following: 



In Theorem 3.5 we shall prove that conditions for asymptotic normality can be expressed 



in terms of norms of contraction operators (see Section 2.2 1. These analytic objects already 
appear in CLTs living inside a fixed Wiener chaos (see |261 127|). and are a crucial tool in 
order to effectively assess bounds based on Malliavin operators. One further important 
point is that the use of contraction operators allows one to neatly distinguish the contri- 
bution of each chaotic projection to the CLT, as well as to deduce joint CLTs for these 



projections starting from the asymptotic normality of their sum (see Proposition 3.14). 
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In Theorem 3.12 we shall prove that, when specialized to random variables such that each 
kernel in the Wiener- Ito representation has a constant sign, our results yield necessary and 
sufficient conditions for asymptotic normality. The main tools in order to show such a 



result are two new analytic bounds, stated in Proposition 3.8 and Proposition |3.9[ These 
findings extend to the Poisson framework the 'fourth moment theorem' proved by Nualart 
and Peccati (in a Gaussian setting) in [21], a result that has been the starting point of a 
new line of research in stochastic analysis - see the book [19J, as well as the constantly 
updated webpage 

http : //www. iecn. u — nancy.fr/ nourdin/steinmalliavin.htm. 

As discussed below, random variables having Wiener-Ito kernels with constant sign appear 
quite naturally in problems arising in stochastic geometry. In particular, we shall use 
our results in order to provide an exhaustive characterization of stationary geometric 
random graphs whose edge counting statistics exhibit asymptotic Gaussian fluctuations 



(see Theorem 4.11 ). This family of geometric graphs contains e.g. interval graphs and disk 
graphs - see e.g. |6l [TJ |8l [151 El EH]. Our characterization of geometric random graphs 
involves 'diagonal subsets' of Cartesian products, that are reminiscent of the combinatorial 
conditions for CLTs used by Blei and Janson in pj, in the framework of CLTs for finite 
Rademacher sums (see also [20] Section 6]). As a by-product of our analysis (see Theorem 



4.14 1, we shall illustrate a new form of Poisson convergence for random graphs with sparse 



connections. 



We stress that one of our main motivations comes from a remarkable paper by Reitzner and 
Schulte \31\ . laying the foundations of a general theory for CLTs involving [/-statistics based 
on Poisson point processes. In particular, one of the crucial insights of |31) concerns the use of 
a formula by Last and Penrose (see |12|). providing explicit expressions for Wiener-Ito chaotic 



decompositions in terms of difference operators (see Theorem 2.9). It is interesting to note 
that Last and Penrose's formula is the Poisson analogous of the so-called 'Stroock formula' 
of Malliavin calculus, which is in turn an important tool for proving CLTs involving non-linear 
functionals of Gaussian measures (see e.g. |19|, Corollary 2.7.8] for a discussion of this point). We 
shall see that our findings complement and extend the results proved in [31j in several directions. 
See also Decreusefond et al. |7], Ferraz and Vergne Last et al. tl3j , Minh tl6j , Schulte |33j . 
Schulte and Thaele |34l 135) . for several new findings pertaining to this line of research. 

In order to keep the length of this paper within bounds, in Section 4 we will present appli- 
cations that are related to a very specific setting, namely edge-counting in random geometric 
graphs with possibly large connections. The power and flexibility of the results proved in the 
present work are further illustrated in the companion paper [llj, where the following applications 
are developed in full detail: 

(i) analytic bounds for the normal approximation of [/-statistics based on marked point pro- 
cesses, in particular [/-statistics with rescaled kernels; 

(ii) bounds for general subgraph counting in the disk graph model under any regime; 

(iii) an exhaustive characterization of the asymptotic behavior of geometric [/-statistics; 

(iv) applications to the boolean model, and to subgraph counting in disk graph models with 
random radius. 

The rest of this section is devoted to the formal presentation of the main problems that are 
addressed in this paper. 
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1.1 Poisson measures 



Throughout the paper {Z, iF, /u) is a measure space such that Z is a Borel space, 2f is the 
associated Borel a-field, and is a fi-finite Borel measure with no atoms. We write = {B £ 
Z : //(-B) < 00} to denote the subclass of !^ composed of sets with finite measure. Also, we shall 
write T] = {r]{B) : B € to indicate a Poisson measure on (Z, Z) with control fi. In other 
words, ?7 is a collection of random variables defined on some probability space (O, P), indexed 
by the elements of 5^ and such that: (i) for every B,C £ Z^j^ such that BOC = 0, the random 
variables rj{B) and r/(C) are independent; (ii) for every B € i^, r]{B) has a Poisson distribution 
with mean fi{B). We shall also write fj{B) = r]{B) — fJ,{B), B £ and fj = {fi{B) : B G J^}. 
A random measure verifying property (i) is usually called "completely random" or "independently 
scattered" (see e.g. [23] for a general introduction to these concepts). 

Remark 1.1 As it is customary, by a slight abuse of notation, we shall often write x £ tj in 
order to indicate that the point x £ Z is charged by the random measure ?7(-). 

In this paper, we shall focus on sequences of random variables {Fn : n ^ 1} having a finite 
Wiener-Ito chaotic decomposition, that is, such that 

k 

1=1 

where the symbol Iq. indicates a multiple Wiener-I6 integral of order qi with respect to f}, the 
integer k does not depend on n, and each f-'^^ is a non-zero symmetric kernel from Z*' to M (see 



1.2 



Section 2.1 below for details). We will be specifically concerned with the forthcoming Problem 



Recall that, given random variables U,Y £ L^{P), the Wasserstein distance between the 
law of U and the law of Y is defined as the quantity 

dw{U,Y)= sup \E[f{U)]-E[f{Y)]\, 
/eLip(i) 

where Lip(l) indicates the class of Lipschitz real-valued function with Lipschitz constant ^ 1. 
It is well-known that the topology induced by dw, on the class of probability measures on the 
real line, is strictly stronger than the one induced by the convergence in distribution. 

Problem 1.2 Find analytic conditions on the kernels {f^""^} ensuring that the sequence 

Fn := — " , n ^ 1, 
7Va^(i^' 

converges in distribution, as n — )• 00, to a standard Gaussian random variable N ~ ^(0, 1), in 
the sense of the Wasserstein distance. Determine under which assumptions these conditions are 
also necessary, and find explicit upper bounds for the sequence 

dw{Fn,N), n^l. 



We will deal with Problem 1.2 in Section |3] where it is shown that a convenient solution 



can be deduced by using contraction operators. Among other features, these operators provide a 
neat way to deal with the product of multiple stochastic integral, and virtually replace the use of 
diagram formulae - see e.g. (23]. As anticipated, we will see that, in the specific case of random 



in) 

variables as in l\l.n such that n ' ^ 0, our results lead to necessary and sufficient conditions 



that are analogous to the so-called 'fourth moment theorems' for sequences of multiple integrals 
in a Gaussian setting - see [2T] . 
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Remark 1.3 Problem 1.2 is also explicitly studied in |3H Section 4]. In particular, Theorem 
4.1 in |31] provides bounds in the Wasserstein distance for random variables having a finite 
chaotic decomposition, where the bounds are expressed in terms of expectations of inner prod- 
ucts of multiple integral stochastic processes. On the other hand. Theorem 4.7 in j31J provides 
an analytic bound, involving sums over partitions, for the normal approximation of absolutely 
convergent f7-statistics. Here, we call 'analytic bound' any upper bound only involving de- 
terministic transformations of the kernel determining the [/-statistic, without any additional 
probabilistic component. 



1.2 Random graphs 

As anticipated, we shall now apply our main theoretical results to the study of geometric random 
graphs whose edge-counting statistics satisfy a CLT. The class of geometric random graphs 
considered below allow for long connections, in the sense that the geometric rule used to define 
edges is based on the use of arbitrarily large sets and therefore is not local. It is worth noting 
by now that our setting represents a natural generalization of the so called Gilbert graphs - see 
Example |1.5| below. Also, as explained in Remark |1.8| below, part of the models we consider 
cannot be dealt with by directly using the powerful theory of stabilization (see e.g. [14j). 

Now let the notation introduced in the previous section prevail. In what follows, we shall 
denote by W (as in 'window') a measurable subset of Z such that fJ-(W) < oo. We first introduce 
the notion of a geometric random graph based on the restriction of the Poisson measure r/ to W, 
and on some symmetric set H C Z x Z. 

Definition 1.4 (Random geometric graphs) Let H C Z x Z he such that fJ.'^{H) < oo, H 
is symmetric (that is, for every (x, y) G H, one also has (y, x) G H) and H is non-diagonal (that 
is, H does not contain any pair of the type (x, x)). 

(a) The random geometric graph based on r], W and H is the undirected random graph 

G = Giv,W,H) = iV,£), 

such that: (i) the vertices of G are given by the class V = r]r]W = {x£r]:x£ W}, 
and (ii) a pair {x, y} belongs to the set £ of the edges of G if and only if (x, y) £ H. We 
observe that, since H is non-diagonal, G has no loops, that is: G does not contain any 
edge of the type {x,x}. 

(b) Assume in addition that Z is a vector space. The random geometric graph at Point (a) is 
said to be stationary if there exists a set H G Z such that 

H={{x,y) eZ^:x-yeH}. 

Note that, since H is symmetric, one has necessarily that H = —H; moreover, since H 
has no diagonal components, ^ H. 



Example 1.5 (i) The class of random geometric graphs introduced above generalizes the 
notion of a Gilbert graph, obtained by taking Z equal to some metric space (endowed with 
a distance d) and H = {(xi,X2) G : < d{xi,X2) < (5}, 5 > 0. In this case, the 
random geometric graph G is obtained by connecting two vertices vi,V2 G t] CiW if and 
only if d{vi,V2) < S. See e.g. [29] . 
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(ii) \i Z = and H = |(xi,X2) G : < — a;2||Kd < 5}, then the correspondmg 
geometric random graph is stationary with H = 5(0, (5)\{0}, where B{<d,S) C Z stands 
for the open ball of radius 5 centered at the origin. Graphs of this type are customarily 
called interval graphs when Z = M, and disk graphs when Z = - see e.g. [SI El El [151 E] 
for recent developments on the subject. 

In Section [l] we shall use our general results in order to deal with the following problem. 

Problem 1.6 Fix a set W C Z, as well as a symmetric function g : W x W ^ M. Consider the 
following objects: (i) a collection of Poisson measures rjx, X > 0, with cr-finite and non atomic 
control measures fj,x such that fix(W) < oo and ^xiW) t oo, as A — )• oo, and (ii) a collection 
of non-diagonal non-negligible symmetric sets Hx <Z Z x Z , \ > 0. Characterize those classes 
{fJ-x, Hx : A > 0} such that the random variables 

F{g,W;rjx,Hx)= Yl 9ix,y), A > 0, (1.2) 

{x,y}&£x 

(where Gx = iVx,£x) is the geometric random graph based on r]x, W and Hx) verify the limit 
relation 

y/YaT{F{g,W;r]x,Hx)) 

as A — )• oo. 



Remark 1.7 (i) At this stage, the role of the window W might seem immaterial, and indeed 



the substance of Problem 1.6 does not change if one takes W = Z. However, the above 
formulation allows for the more general case of a window W = Wx possibly depending on 
A. Moving windows of this type appear in Section 4, as well as in the paper [11]. 

In many examples and applications, one considers sets Hx such that a{Hxr](W x W)) ^ 0, 
as A — )• oo, for some fixed measure a on W x W. Heuristically, the fact that fj,x{W) t oo 
and a{Hx D (W x W)) I ensures that the following phenomenon takes place: as A grows, 
more and more vertices and edges are added to the geometric graph, whereas old edges are 
deleted as a consequence of the asymptotic negligibility of HxCiiW x W). Solving Problem 
|1.6| in this framework is equivalent to characterizing all sequences of random geometric 
graphs such that the addition of vertices and the cancellation of edges compensate, thus 
generating asymptotic Gaussian fluctuations. 



When specialized to the case of Gilbert graphs on Z = M , Problem 1.6 is tackled in the 



classic reference \29\ Chapter 3] as a special case of general subgraph counting. A comparison 



with the results of |29| Chapter 3] is provided in Section 4.3.1 below. A complete solution of 



Problem 1.6 for general subgraph counting in Gilbert graphs, based on the techniques developed 



in this paper, is presented in Section 3]. See also \31\ Section 6.2]. 



Remark 1.8 Assume that, for every x € r], there exists a random radius such that all the 
y connected to x in the random graph lie in the ball with center x and radius Rx- Then, the 
variable F = F{l,W;r]x, Hx) in (1.2) is stabilizing, meaning that F can be written in the form 



Y(ix,v), 
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where ^ is such that ^(x, r/) is not modified by adding or removing a finite number of points to 
r] outside the ball with center x and radius Rx (see |14| for more details on this topic). In our 



case, to fit the framework of formula (1.2) in the case g = 1, ^{x,r]) should be defined as 



1 

C{x,r]) = -#({y / X : {x,y} £ £x}), 

where indicates the cardinality of A. The CLTs presented for instance in [U [30] cover 
well this case. Remark that in this particular framework of a deterministic connection rule, 
stabilization theory only allows for a bounded length, while we consider here models where 
points can have arbitrarily long connections. 

The rest of the paper is organized as follows. In Section [2| we discuss several background 
results concerning Poisson measures, Wiener chaos and {/-statistics. Section [3] contains our main 
abstract results concerning the normal approximation of random variables having a finite chaotic 
decomposition. Section |4] focuses on random graphs and on several analytical characterizations 
of associated CLTs. An Appendix (see Section [5]) provides some basic definitions and results of 
Malliavin calculus. 



2 Preparation 

2.1 Multiple integrals and chaos 

As before, (Z, 3f, /x) is a non-atomic Borel measure space, and 77 is a Poisson measure on Z with 
control fi. 

Remark 2.1 By virtue of the assumptions on the space (Z, 3f,fi), and to simplify the discus- 
sion, we will assume throughout the paper that ($7, P) and rj are such that 

= jo; = Y2^6^^,n G NU {oo},2;j G z| , 

where 5z denotes the Dirac mass at z, and rj is defined as the canonical mapping 

Also, the cr-field ^ will be always supposed to be the P-completion of the cr-field generated by 
rj. 

Throughout the paper, for p G [1, 00), the symbol L^{n) is shorthand for U'{Z, 3^, fi). For an 
integer q ^ 2, we shall write L^^u"^) := L/'(Z'^, ^'^), whereas L^{fi'^) stands for the subspace 

of LP(/i'^) composed of functions that are /u'^-almost everywhere symmetric. Also, we adopt the 
convention LP{n) = I^s{fJ.) = IJ'{ii^) = L^{ij}) and use the following standard notation: for 
every 1 and every f,g € Lp'^pfl)^ 

{f^9)L^{l,1) = j^Jizi, Zg)g{zi, Zq)fl'^{dzi, ...,dZq), ||/||l2(^,) = {f,f)]!2(^,y 

For every / G L^(//^), we denote by / the canonical symmetrization of /, that is, 

1 v-^ 

f{xi,...,Xq) = — 2^/(x„(i),...,X^(g)), 
^' a 

where a runs over the q\ permutations of the set {1, . . . , q}. Note that 11/11^2(^9) ^ ll/llL2(^q) (to 
see this, use for instance the triangular inequality) . 
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Definition 2.2 For every deterministic function h G L?'{^), we write 



= f]{h) = / h{z)f]{dz 



z 



to indicate the Wiener-Ito integral of h with respect to f). For every q ^ 2 and every / G L'^^fj,'^), 
we denote by the multiple Wiener-Ito integral, of order of / with respect to r}. We also 

set Iq{f) = Iqif), for every / G Lp'{^'^) (not necessarily symmetric), and /o(^) = ^ foi' every real 
constant b. 



The reader is referred for instance to the monograph |24| . by Peccati and Taqqu, for a com- 
plete discussion of multiple Wiener-Ito integrals and their properties (including the forthcoming 



Proposition 2.3 and Proposition 2.4). 



Proposition 2.3 The following equalities hold for every q,m^ 1, every f G Ll^^jfl) and every 
1. E[Iq{f)] = 0, 

2- E[Iq{f)Im{g)] = ql{f,g)LHpi)'^(g=m) (isometric property) . 

The Hilbert space composed of the random variables with the form Iq{f), where q ^ 1 and 
/ G Ll{pfl), is called the qth. Wiener chaos associated with the Poisson measure rj. The following 
well-known chaotic representation property is an essential feature of Poisson random measures. 
Recall that ^ is assumed to be generated by rj. 

Proposition 2.4 (Wiener-Ito chaotic decomposition) Every random variable 

F G L^{n,^,P) := L^{P) 
admits a ( unique ) chaotic decomposition of the type 

oo 

F = E[F] + Y,I^{f^), (2.4) 

1=1 

where the series converges in L^{P) and, for each 1, the kernel fi is an element of Ll{fi''). 



2.2 Star contractions and multiplication formulae 

We shall now introduce contraction operators, and succinctly discuss some of their properties. 
As anticipated in the Introduction, these objects are at the core of our main results. 

The kernel f -k^ g on Z^~^'^~^~\ associated with functions / G L1{^^) and g G Ll{ijfl), where 
p,q'^ 1, r = 1, . . . ,p /\q and / = 1, . . . , r, is defined as follows: 

/*r ^(TI^ • • -^Ir-hti, , . . .,tp-r,Si, , . . . , Sq-r) (2.5) 
fJ,'-{dzi, ...,dzi)f{zi, , . . . ,Z/,7i,. . .,-fr-hti, , . . . ,tp_r) 

Xg{zi, , . . . ,Zi,Ji, . . .,Jr-l,Sl, , . . ■,Sq-r). 

Roughly speaking, the star operator ' ' reduces the number of variables in the tensor product 
of / and g from p + q to p + q — r — I: this operation is realized by first identifying r variables in / 
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and g, and then by integrating out I among them. To deal with the case I = for r = 0, . . . ,pAq, 
we set 

/ *r dilli • • • 1 Trj tl, , ■ ■ ■ , tp—r, Si, , . . . , Sq—r) 
= fill, • ■ ■ , 7r, , • • • , tp-r)g{jl, . . . ,^r,Sl, , . . . , Sq-r), 

and 

/*0 ^(^1, ,tp,Sl, , . . . ,Sq) = f g{ti, ,...,tp,Sl,,...,Sq)= /(tl, , . . .,tp)g{si, , . . . , Sq). 

By using the Cauchy-Schwarz inequahty, one sees immediately that f -k^ g \s square-integrable 
for any choice of r = 0, . . . ,p A 5 , and every / E Ll{^^), g S L'Kfi'^). 



Remark 2.5 For every 1 ^ p ^ q and every r = 1, ...,p, 

[ (/ 4 <?)2d/x*'+«-'- = / (/ f){g g)df,^, (2.6) 

for every / G Lg(/uP) and every g G L^(/x'^) 

The next statement contains an important product formula for Poisson multiple integrals 
(see e.g. P^j for a proof). 

Proposition 2.6 (Product formula) Lei / G L'liiJ') and g G L^(/i'^), p, g ^ 1, and suppose 
moreover that f -k\. g ^ L^(^P'^'^^'''~') /or every r = 1, . . . ,p A q and I = 1, . . . ,r such that I ^ r. 
Then, 

ip{f)m = E ( ^ ) ( ' ) E ( 1) Wr-. (74^) , (2.7) 

i/ie tz/de ~ indicating a symmetrization, that is, 

f -k\. g(x\, . . . , Xpj^q—r—l) ~i ~ Try ^ ^ / *r 5'('^o-(l) ) • • • ) Xcr{p+q—r—l)), 

[p + q — r — L). ^ 

where a runs over all (p + q — r — /)! permutations of the set {1, ...,p + q — r — I}. 



2.3 About the Malliavin formalism 

For the rest of the paper, we shall use definitions and results related to Malliavin-type operators 
defined on the space of functionals of the Poisson measure r]. Our formalism coincides with the 
one introduced by Nualart and Vives in |22) . In particular, we shall denote by D, 6, L and L~^, 
respectively, the Malliavin derivative, the divergence operator, the Ornstein- Uhlenbeck generator 
and its pseudo-inverse. The domains of D, 6 and L are written douiD, domJ and domL. The 
domain of is given by the subclass of L'^{P) composed of centered random variables. For the 
convenience of the reader we have collected some crucial definitions and results in the Appendix 
(see Section [5|. Here, we just recall that, since the underlying probability space Q is assumed 
to be the collection of discrete measures described in Remark |2.1[ then one can meaningfully 
define the random variable u 1— )• Fz{uj) = F[oj + 5z), a; G 0, for every given random variable F 
and every z ^ Z, where 6z is the Dirac mass at z. One can therefore prove that the following 
neat representation of D as a difference operator is in order. 

Lemma 2.7 For each F G domD, 

D^F = Fz- F, a.e.-fi{dz). 



A complete proof of Lemma 2.7 can be found in j22) . 
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2.4 ^/-statistics 



Following |3H Section 3.1], we now introduce the concept of a ?7-statistic associated with the 
Poisson measure rj. 

Definition 2.8 ([/-statistics) Fix k ^ 1. A random variable F is called a U-statistic of order 
k, based on the Poisson measure rj, if there exists a kernel / G L].{ij}') such that 

F= fixi,...,Xk), (2.8) 

(xi,...,Xfe)e?7^ 

where the symbol r]^ indicates the class of all /c-dimensional vectors (xi, x^) such that Xi E rj 
and Xi 7^ xj for every 1 ^ i ^ j ^ k. As made clear in |3H Definition 3.1], the possibly 



infinite sum appearing in (2.8) must be regarded as the L^{P) limit of objects of the type 
^ixi,...,Xk)€ri^nAn /(^ii •••> ^fe)' ^ 1) where the sets An G are such that fi^{An) < oo and 
An t Z^, as n — ^ CO. 



Plainly, a [/-statistic of order one is just a linear functional of rj, with the form 



for some / G L^{fi). The following statement, based on the results proved by Reitzner and 
Schulte in |31) . collects two crucial properties of [/-statistics. 



Theorem 2.9 (See [31j) Let F G L^{P) be a U-statistic as in (^2.8|. Then, the following two 
properties hold. 

(a) The expectation of F is given by 

E[F]= [ f{zu...,Zk)fiHdzi,...,dzk). (2.9) 



(b) If F is also square-integrable, then necessarily f G Lg{fi ), and the Wiener-Ito represen 



tation (2.4) of F is such that fi = 0, for i ^ k + 1, and 

/i(xi, ..,Xi) = ( ^ ) / f{xi,...,Xi,zi,...,Zk-i) fJ.''~'{dzi,...,dzk-i) 



(2.10) 



for every i = l,...,k. In particular, fk = f- For each i = l,...,k, one has that fi G 

L,(M0nL2(^^). 



One should note that formula (2.10) follows from an application of the results proved by 
Last and Penrose in |[12j. 



2.5 ^/-statistics and random graphs 

In this paper, we will be interested in characterizing the Gaussian fluctuations of [/-statistics 
having a specific support. In particular this allows one to deal with the set of 'local [/-statistics" 
introduced by Reitzner and Schulte in |3H Section 6]. Recall that a set H £ Z^ is called 
symmetric if the following implication holds: if (xi,...,Xjt) G H, then (Xo-(i'), x^(^)) G H for 
every permutation a of {1, ...,k}. 
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Definition 2.10 (Support of a ?7-statistic) Let k ^ 2, and let H <Z he a. measurable 



symmetric set. A [/-statistic F as in (2.8) is said to have support in H if the function / is such 
that 

= 0, V(xi, ...,Xfc) ^ 

Example 2.11 (Local [/-statistics) Let Z he & metric space. Then, the class of local U- 
statistics, as defined in |3H Section 6], coincides with the family of [/-statistics having support 
in a set of the type H = {(xi, ...,Xfc) : diam({xi, ...,Xfc}) < 5} for some 6 > 0. Here, the symbol 
diam(S) is shorthand for the diameter of B. 

We shall now point out a well-known connection between [/-statistics and hypergraphs. 
Recall that a hypergraph of order A; ^ 2 is a pair {V, £), where V = {vi, ...,Vm) is a set of vertices, 
and £ = {Ei, Es) is a collection of (possibly non-disjoint) subsets of V (called edges), such 
that each Ei contains exactly k elements; in particular a hypergraph of order 2 is an undirected 
graph. 

Remark 2.12 ([/-statistics as graph statistics) (i) Let k ^ 2, let F be a [/-statistic as 



in (2.8), and assume that / = Ij^fe x Ijj, where W G Z is some set (usually called a 
'window') such that fJ,{W) < oo. Then, the random variable ^F counts the number of 
edges in the random hypergraph {V,£), obtained as follows: V = rjCi W, and the class of 
edges £ is composed of all subsets {xi, ...,Xfc} C V such that (xi, ...,Xfc) S H. 

If fc = 2, Z is some metric space (endowed with a distance d) and H = {(xi,X2) € Z"^ : 
d{xi,X2) < 5} and / = Iw x 1h, then the random variable ^F counts the number of 
edges in the undirected graph whose vertices V are given by the points of W charged by rj 
and such that two vertices vi,V2 are connected by an edge if and only if < d{vi,V2) < S. 
These are the 'Gilbert random graphs' discussed in Example iLSfi). 



To conclude, we present the notion of a stationary [/-statistic. It will play an important role 
in Section m 

Definition 2.13 (Stationary [/-statistics) Fix k ^ 2, assume that Z is a vector space, and 



let F be a [/-statistic of the type (2.8), having support in a symmetric set H). We shall say 
that F is stationary if there exists H C Z^~^ such that 

H = {(xi, ...,Xfc) : (x2 - xi,X3 - xi, ...,Xk - xi) G H}. (2.11) 

Example 2.14 Consider the case k = 2, Z = M.'^ and H = {(xi,X2) G Z^ : ||xi — X2||Kd < 5}. 
Then, the corresponding [/-statistic F is stationary, with H = B{0, 6), where B{0, 6) C Z stands 
for the open ball of radius 6 centered at the origin. See Example |1.5[ii). 



3 Normal approximations for finite chaotic expansions 
3.1 Framework 

We shall tackle Problem |1.2[ by focussing on the normal approximation of random variables F 
having the form 

k 

F = E[F] + Y.Wi), (3.12) 

1=1 

where: 
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/c ^ 1 is an integer; 

the integers qi, i = 1, ...,k, are such that 1 ^ qi < q2 < ■ ■ ■ < Qk'^ 

the symbol Iq indicates a multiple Wiener- Ito integral of order q, with respect to a centered 
Poisson measure ij = r] — fi, where 77 is a Poisson measure on the Borel measurable space 
{Z, with deterministic and a-Hnite control measure ^; 

each kernel is a nonzero element of L'^{ij,'^'), and the class {fi : i = 1, ...,k} verifies in 
addition the forthcoming Assumption |3.1[ 



Assumption 3.1 (Technical assumptions on integrands) Let the notation of Section 2.2 



prevail. Every random variable of the type (3.12) considered in the sequel of this paper is such 
that the following properties (i)-(iii) are verified. 

(i) For every i = 1, d and every r = 1, ...q^, the kernel fi ^q-"*" fi is an element of Lp'{fi''). 



(ii) For every i such that qi ^ 2, every contraction of the type {zi, Z2qi-r-i) '-^ \fi\ *r 
\fi\{zi,...,Z2qi-r-i) is well-defiued and finite for every r = l,...,qi, every / = l,...,r and 
every (zi, Z2q^-r^i) 



(iii) For every i,j = 1, ...,d such that max{qi,qj) > 1, for every k = \qi — qj\\/ 1, ...,qi + qj — 2 
and every (r, /) verifying k = qi + qj — 2 — r — I, 



fi{dz) < 00, 



where, for every fixed z € Z, the symbol fi{z,-) denotes the mapping (zi, Zq-i) 1— )• 

fi{z,Zl, ...,Zq^i). 



Remark 3.2 According to [ 271 Lemma 2.9 and Remark 2.10], Point (i) in Assumption 3.1 
implies that the following properties (a)-(c) are verified: 

(a) for every 1 ^ i < j ^ k, for every r = 1, ...,qi A qj and every / = 1, ...,r, the contraction 
fi fj is a well-defined element of 

(b) for every I ^ i ^ j ^ k and every r = 1, ...,qi, fi -k^. fj is an element of L^(/i'''"'"*~^); 

(c) for every i = 1, ...,k, for every r = 1, ...,qi, and every I = 1, ...,r A {qi — 1), the kernel 
fi fi is a well-defined element of L^(/x^'''~''~'). 



In particular, the multiplication formula (2.7) implies that every random variable F verifying 
Assumption 3.1 is such that Iqi{fi)'^ € -^^(-P) for every i = 1, k, yielding in turn that < 



00. Following a similar route, one can also show that, under Assumption |3.1[ the random variable 
{DF^ — DL~^F)^2(^) is square-integrable (and not merely an element of L^{P)). 



Remark 3.3 Assumption 3.1 imply that Assumptions A-B-C in [27J are verified, so that the 
computations therein can be directly applied in our framework. 



Remark 3.4 For instance. Assumption 3.1 is verified whenever each fi is a bounded function 
with support in a rectangle of the type B x ■ ■ ■ x B, where n{B) < 00. 
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3.2 A general bound 



Let F be a random variable as in (3.12) such that E[F^] = > {a > 0) and E[F] = m e M, 
and consider a Gaussian random variable ~ ^{m,a'^) with the same mean and variance. 
Then, a slight modification of \26\ Theorem 3.1] (the modification resides in the fact that we 
consider an arbitrary variance u^) yields the following estimates: 



dw{F,N) ^ Bi{F;a) ^ B2iF;a), 



where 



BiiF;a) 



B2{F-a) 



-^E[\a'' - {DF,-DL'^F)l2(^^-^\] 
1 



+ - 



E[{D,Ff\D,L-^FMdz), 



1 



a 



E[{a^-{DF,-DL-^F)L.^^f 



(3.13) 



(3.14) 



(3.15) 



+ \ I E[{D,Ff\D,L~^FMdz). 

Jz 



The next statement shows that B2{F; cj^) can be further bounded in terms of the contractions 
introduced in Section 12.21 



Theorem 3.5 Let F and N be the random variables appearing in (3.13)-(3.15). Then, there 
exists a universal constant Cq = Co{qi, qt) G (0,oo), depending uniquely on qi,...,qk, such 
that 



B2{F;a) ^ Co x ^3(^:^7), 



(3.16) 



where 



Bs{F;a) 

= ^ Imax Wfi -kl /i||^2(^2,,-r-!) + max \\fi fj\ 



(3.17) 



In the previous expression, max ranges over all 1 ^ i ^ k such that qi > 1, and all pairs {r,l) 
such that r £ {1, qi} and 1 ^ / ^ r A (g^ — 1), whereas max ranges over all 1 ^ i < j ^ d and 



all pairs {r,l) such that r E {l,...,qi} and I E {l,...,r}. When qi = 1, one can replace 



in the previous bound by the smaller quantity 

ll/l||L2(/i) 



(3.18) 



Proof of Theorem 3.5 Without loss of generality, we can assume that m = 0. Also, throughout 
this proof, we write Fi = Iq^ifi) and = E[Iq-{fi)'^] = gi!||/j||^2(^<j,), i = 1, ■■■,k, in such a way 

that = E[F^] = Y.ti E[F^] = Eti Now write 



E [{a^ - {DF, -DL-iF)i2(^))2] ^ ^ U,>iJe [{af - {DF^, -DL-^Fi) ^2^^^) 



i=l 



+ ^E[{{DF,,-DL-^F,)l2^^)Y], 
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so that one can directly apply |27| Proposition 5.5] and deduce that there exists a constant 
a = a(gi, ■.■,qk) such that 



'E [{a^-{DF,-DL-^F)L2^^)r] 
^ a|max||/i*J, /j||^2(^2,,-r-i) + max 4 /jlli2(^'?»+'jj-'-')} • 
To conclude, observe that 

^ E[{D,Ff\D,L-'FMdz) ^ J^E (j^ \DzFi\^ (j^ \D,L-^F^\^ 



so that |27l Proposition 5.6] implies that there exists a constant b = b{qi, ■■■,qk) such that 

I^E[{D,Ff\D,L-'FMdz) ^ 6a|max||/i4/i||^2(^2,,-.-.) + .maxj|/i||24(^,,)| . 

Taking Cq = a + b yields the desired conclusion. The last assertion in the statement comes from 
the fact that, when qi = 1, fr, E[{D^Fy\D^L-'^F\]fi{dz) = ||/i||i3(^). ■ 

Remark 3.6 According to |27] Lemma 2.9], for every quadruple {i,j,r,l) entering the expres- 
sion of max in (3.17), the following estimate holds: 

ll/j *r /ill^2(^<?i+'Jj-r--!^ ^ Wfi *i fi\\L'2(fj.'2H-r-l) X 1 1 /j *r /?' 1 1 ^2 (^29^ -r-! j . (3.19) 



3.3 Estimates for positive kernels 



We shall now specialize Theorem 3.5 to the case on random variables having the form (3.12 ) and 



such that fi ^ 0. In particular, we shall unveil some useful connections between the quantity 
B^^F; a) and the fourth cumulant of F. 

Remark 3.7 Random variables admitting a Wiener- Ito chaotic expansion with positive kernels 



appear rather naturally in stochastic geometry. For instance, an application of (2.10) shows that 



any [/-statistic with a positive kernel admits a Wiener-Ito chaotic expansion of this type. Note 
that many counting statistics have the form of [/-statistics with an integer-valued (and therefore 
nonnegative) kernel - such as for instance the subgraph-counting statistics in random geometric 
graphs (see e.g. |29| Chapter 3] and the references therein), or the statistics associated with 
hyperplane tessellations considered in 



The following statement concerns random variables of the form (3.12), in the special case 



where E[F] =0, k = 1 and the multiple stochastic integral has a nonnegative kernel. 



Proposition 3.8 (Fourth moment bound, I) Consider a random variable F as in (3.12), 

CT^ > 0. Then, 



with the special form F = Iq{f), q^ ^, where / ^ and E[F ] = ql 



B3iF;a) = ^[m^x\\f4fh, 



(^29-r-!) 



l^ + 



and there exist universal constants c\ = ci(q) <C\= Ci{q), depending uniquely on q, such that 



ci X B^{F; a) ^ .JE[F^] - 3^4 ^ Ci x B^{F- a). 



(3.20) 
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Proof. Using the multiplication formula (2.7), together with (2.6) and the positivity assumptions 
on /, one sees that F"^ can be written in the following form 

= + hgif 4 f) + R, 
where i? is a random variable orthogonal to the constants and to /2g(/*o /) such that 



J (max 11/*^ /II 



L 1 



r J llL2(^2q-r-i) 



+ 



(^2<J-r-i) 



for some universal positive constants J, K depending uniquely on q. The conclusion is obtained 
by using the relation 



where we have used |24| formula (11.6.30)]. 



The following general bound deals with random variables of the form (3.12 ) and with positive 
kernels. 



Proposition 3.9 (Fourth moment bound, II) Let F be as in (3.12), with k ^ 1, and as- 
sume moreover that E[F] = 0, and E[F'^] = o"^ > 0, and ^ for every i. Then, there exists 
a universal constant C2 = C2{qi, ■■■,qk), depending uniquely on such that 

jmax II/, 4 /,||^,(^.,,_._.) + ^max^ ||/dli4(^,.) } ^ C2^/E[F^]^3^. (3.21) 

Proof Write as before Fi = Iq.ifi), and = E[F^], i = I, k. We can now write 

k 

EiF^'j-Sa' = ^{E[F,'] - 3af} 

i=l 

+ 6 Yl {E[F^Ff]-afa]}+ Yl E[F,^F,,F,^F,^] 
:= W + Y + Z, 

where stands for the collection of those {11,12, is, i 4) S {Ij---)^}^) such that one of the 
following conditions is satisfied: (a) ii ^ 12 = is = ^4, (b) ii ^ 12 = is ^4 ^^'^ ^4 iii 



(c) the elements of (ii,i25^3;U) are all distinct. Applying the multiplication formula (2.7) and 
exploiting the fact that each fi is nonnegative, we immediately deduce that Y ^ and Z ^ 0, 



so that the desired conclusion follows from Proposition 3.8 



3.4 Conditions for asymptotic Gaussianity 



This section contains a general statement (Theorem |3.12[ ) about the normal approximation 
of random variables admitting a finite chaotic decomposition. The first part of such a result 
provides sufficient conditions for Central Limit Theorems, that are directly based on Theorem 
3.5| As indicated in the subsequent parts, these conditions become necessary whenever the 



involved kernels are nonnegative. Theorem 3.12 is one of the main results of the paper, and is 
the main tool used to deduce the CLTs appearing in Section 4 and in [Tl]. 
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More precisely, in what follows we shall fix integers k ^ 1 and 1 ^ qi < q2 < ■■■ < Qk (not 
depending on n), and consider a sequence {F^") : n ^ 1} of random variables with the form 



(3.22) 



i=l 



each verifying the same assumptions as the random variable F appearing in (3.12 ) (in particular, 
Assumption |3.1| is satisfied for each n). We also use the following additional notation: (i) 



c(")^ 



l,...,d, and (iii) af{n) 



Em 



H\2] 



Remark 3.10 One of the main achievements of the forthcoming Theorem 3.12 is the 'fourth 
moment theorem' appearing at Point 3 in the statement, which only holds for random variables 
such that the kernels in the chaotic decomposition are nonnegative. As first proved in [21j (see 
also |19| Chapter 5]) an analogous result holds for general sequences of multiple Wiener-Ito 
integrals with respect to a Gaussian process. In particular, in the Gaussian framework one does 
not need to assume that the integrands have a constant sign. Proving analogous statements 
in a Poisson setting is quite a demanding task, one of main reasons being the rather intricate 
multiplication formula ( |2.7| ). Some previous partial findings in the Poisson case can be found 
in Peccati and Taqqu [23j (for sequences of double integrals) and in Peccati and Zheng [28j (for 
sequences of multiple integrals having the form of homogeneous sums). 



Remark 3.11 In the statement of Theorem 3.12 we implicitly allow that the underlying Pois- 
son measure r/ also changes with n. In particular, one can assume that the associated control 
measure fi = explicitly depends on n. This general framework is needed for the geometric 
applications developed in Section [4] 



Theorem 3.12 Let {F^")} be a sequence of random variables as in (3.22), and assume that 
there exists cj^ > such that lim„_s>oo cr'^{n) = a"^ . Let N ~ ^(0, cr^). 



1. For every n, one has the estimate 



dw{F^''\N) ^ Co X i?3(FW;a(n)) + \a\n) 

cr[nj V fj 



a 



In particular, if B^{F^'^^;a{n)) 0, as n oo, then dvy(F("), A^) 
N. 



pin) 



(3.23) 

and therefore 



(n) 

2. Assume that ff' ^ for every i, n. Then, a sufficient condition in order to have that 
B^{F^'^);a{n)) ^ ^s that ^[(F^"))"^] - 3f7^(n) ^ 0. 

3. Assume that /•'•"^ ^ for every i,n, and also that the sequence (F*^"))^, n 1, is uni- 
formly integrable. Then, the following conditions (a)-(c) are equivalent, as n ^ oo; (a) 
dwiF^^'^N) 0, (b) S3(F(");cj(n)) ^ 0, and (c) ^[(FW)^] - 3c7^(n) ^ 0. 

Proof. 1. Let Nn ~ ^(0,cr^(n)), n ^ 1. Then, one has that (see e.g. |19, Proposition 3.6.1]) 
dw{Nn,N) ^ \a\n)-a\ 



so that the desired conclusion follows from Theorem 3.5 as well as the inequality 

dw.(F("),Ar) ^ dw{F'^^''\Nn)+dw{Nn,N). 
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2. If - 3cj'^(n) 0, then relation ( |3.2l| implies that 

^„^||f(n) ,/ f(™)|| I llf(")||2 

max II/. ★^Z- llL2(^2,,-r-i) + .max^ II/. ||i4(^,,) 



0. 



Using (3.19), we see that the last relation implies that B3(F^"'^;a{n)) — )• 0, so that the desired 
conclusion follows from Point 1 in the statement. 

3. In view of Point 1 and Point 2 in the statement, we shall only prove that (a) =^ (c). To 

prove this implication, just observe that F^"'^ N and {(F^"))'*} is uniformly integrable, 
then necessar ily E[N'^] = 3a^, so that the conclusion follows fr om the fact that 

o-^(n) —?- a^. ■ 

Remark 3.13 A sufficient condition in order to have that the sequence {(F^"-*)^} is uniformly 
integrable is the following: there exists some e > such that 



sup 



S|f("¥+^ < oo. 



We shall use some estimates taken from |27| (see, in particular. Table 2, p. 1505, therein). 
Given a thrice differentiable function : M'^ — J- 



|(/?"||oo = max sup 



we set 

^2 



I '/'"'I loo = max sup 



Q3 



<p{x) 



dxi^dxi^dx 



-fix] 



Proposition 3.14 Let the assumptions and notation of Theorem 3.1S\ prevail, and suppose that 
as n —7- oo. Let Nn,i ~ ^{0,(jf{n)), i = l,...,k, Then, for every thrice 
differentiable function (/9 : M'^ — t- M, such that ||93"||oo; ||95"'||oo < co, one has that 



E[^{Fi''\...,F^''>)]-E[y,{Nn,i,...,Nn,k)]^0, n ^ oo. 
Proof According to [ 27j . the following estimate takes place: 
|F[(^(Ff\...,Fi"))]-i?[(^(iV„,i,...,iV„,fc)]| 



^ E[{afin)h=, - {DF,, -DL-^F,) ^2^^^] 



I II "'II 

+ lloo 



E 



^ k \ / k \ 

Y.\d.fA [Y^\d.l-^fA 

\i=l J \i=l / 



so that the conclusion follows from (3.16). 
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4 Edge-counting in random geometric graphs: from 
Gaussian fluctuations to clustering 



4.1 Framework 



Our aim is now to tackle Problem 1.6 Throughout this section we shall work under the following 



slightly more restrictive setting (we use the notation of Problem 1.6). 



- For every A > 0, the control measure of 7]x is nx{dx) = A x 9{dx), where ^ is a cr- finite 
non-atomic measure on (Z, 2f). 

- The symmetric function g : W x W ^ M. is bounded (this assumption can be relaxed - 
see the discussion below). 

In the forthcoming Section |4.2[ we will show that the normal approximation of the ran- 



dom variables F{g, W; rjx, Hx) (as defined in (1.2)) can be completely characterized in terms of 



some diagonal restrictions of Cartesian products of the sets Hx- Among several consequences, 
this remarkable phenomenon implicitly provides a new geometric interpretation of contraction 
operators. 

Definition 4.1 (Sets with diagonal restrictions) Given the sets H\ A > 0, d efining the 



random variables F((7, VF; -f^A) and F{g,W; rjx, Hx), 'respectively in (1.2) and (1.3), we define 
the following four sets 

- H^^^ := {{xi,X2,X3) £ : {xi,X2) G -f^A, (xi^xs) £ Hx} C Z^; 

- := {{xi,X2,X3,Xi,X5) G Z^ : {xi,Xi) G Hx, Vi = 2, ...,5} C Z^; 

- H^^^ = {{xi,X2,X3,X4) G : (xi,X2) G Hx,{x2,X3) G Hx,{x3,X4) G Hx,{x4,xi) G 
^a} C Z^; 

- H^x^ := {{xi,X2,X3,X4,X5) G Z^ : {xi,X2) G Hx,ix2,X3) G Hx,{x3,X4) G i^A, (3:4,3:5) G 
^a} c Z^. 

We shall also use the following quantities, defined for every A > 0: 
(i) 

V^i9) = Vlx{9) + Vi^xi9) 

:= 4A^ / g-k^g{xi,X2,X3)e^{dxi,dx2,dx3) + 2X'^ / g'^{xi,X2)9'^{dxi,dx2); 



(ii) Axig) = A'/y/^^snnf U!=29{xi,Xi)e^{dxi, ...,dx5); 

(iii) Bx{g) = ^ I^a^h^^) 9{xi,X2)g{x3,Xi)g{xi,Xi)g{x2,X3)e'^{dxi, ...,dxi); 

(iv) Cx{g) = ^^^^ ^J IivsnHi"-^ g'^{xi, X2)g'^{xi, X3)d^{dxi, dx2, dx3); 
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(v) Dx{g) = J^2^H^g'^{xi,X2)6'^{dxi,dx2); 

(vi) Ex{g) = ^^^^ /vK^n/ff 9{xi,x2)g{xi,X3)g{x2, X4)g{x^,x^)e^{dxi, dx^). 
The following expressions are obtained by setting 5 = 1: 



(ii 



m 



[lY) CA(l) = A3/y^3(VF3ni/«); 

(v') Dxii) = x^e^{w^nHxy, 
(vi') Ex{i) = x^/^^/e^w^^^H^). 

4.2 General conditions and bounds 

We start with a general estimate. 

Theorem 4.2 (General bound for geometric graphs) Let the previous assumptions and 
notation prevail, and let N ~ ^(0, 1). Then, there exists a universal constant C, not depending 
on X, such that, for every A > 0, 

dw{F{g,W;m,Hx),N) ^ x ma.^{Ax{g),Bx{g),Cx{g),Dx{g),Ex{g)}. (4.24) 



If the class {F{g, W; rjx, H\) : A > 0} is uniformly integrahle, then the RHS of (4.24) converges 



to zero, as X ^ oo, if and only if the CLT (1.3) takes place. 



Proof. In what follows, we write F\ = F(g,W;r]x, Hx) and Fx = F{g,W;r]x, Hx) to simplify 



the notation. Last and Penrose's formula (2.10) implies that the random variable Fx admits the 
following chaotic decomposition 

Fa = E[Fx] + /i(/i,a) + /2(/2,a) := E[Fx] + Fi,a + F2,x, (4.25) 

where /i,a(x) = 2X l{HxnWxW}{x,y)g{x,y)0{dy) and /2,A(a;i, X2) = lH^niWxW){xi,X2)g{xi,X2). 
Routine computations imply then that 

^i'A(5)=Var(F,,A), i = l,2. (4.26) 

It follows that 

Fx = h{fi/Vx{g))+l2{f2/Vx{g)). 
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The upper bound (4.24) is now obtained by using (3.17), as well as the following relations, that 
can be proved by a standard use of the Fubini Theorem: 





ll/l,A 






Bx{9) = 


ll/2,A 


A /2,A 




Cxig) = 


ll/2,A 


*2 /2,A 




Dx{g) = 


ll/2,A 






2Ex{g) = 


ll/l,A 


A /2,A 





The last assertion in the statement follows from a direct application of Theorem 3.12 



□ 



The next two statements provide simplified bounds in case one of the two elements of the 
chaotic decomposition of F{g,W;r]x, Hx) converges to zero, as A — t- oo. The proof (which is 
standard and left to the reader) uses ( |4.26 ) as well as the following basic estimate: if Q,R,S 
are three random variables in L^{P), then 

dwiQ + R,S)i^ dwiR, S) + E[\Q\]. 

Proposition 4.3 (Dominating first chaos) // 

V2,x{g) 



0, 



as A — )• oo, 



ViAg) 

then there exists a constant Ci, independent of X, such that, for A large enough, 



(4.27) 



dw{F{g,W;vx,Hx),N)^Ci 



V2Ag) 



+ 



X max{^A(5)} > ■ 



(4.28) 



Vi,x{g) ' vl,{g) 

If the class {F{g,W;rix, Hx)"^ : A > 0} zs uniformly integrahle, then the RHS o/(4.28) converges 
to zero, as \ ^ oo, if and only if the CLT (1.3) takes place. 



Proposition 4.4 (Dominating second chaos) // 

Vi,xig) 



0, as A 



oo. 



^2,A(ff) 

then there exists a constant C2, independent of X, such that, for X large enough. 



(4.29) 



dw{F{g,W;rjx,Hx),N)^C2 



+ 



1 



^l,A(g) 

V2,xig) ' Vi,{g) 



X max{Bxig),Cxig),Dx{gm } . (4.30) 



// the class {F{g, W; rjx, Hx) : A > 0} is uniformly integrahle, then the RHS of (4.30) converges 
to zero, as X ^ 00, if and only if the CLT (1.3) takes place. 
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4.3 Edge counting in stationary graphs 

For the rest of the section, we fix an integer d ^ 1. For every A > 0, we define the set 
Q\ = [— ^A"'^/'^, \ . We now specialize the framework of the previous two sections to the 
following setting 

Z = M^ W = Qi, g = l, fix = Xi, (4.31) 

where i is the Lebesgue measure on M'^. We shall assume that, for every A > 0, the symmetric 
non-diagonal set Hx has the form 

Hx = {{x,y) : X - y G Hx}, 

for some set Hx verifying 

l(HxnQi)>0. (4.32) 

Remark 4.5 We insist that the novelty here (with respect to the usual setting of disk graphs 
- see e.g. [291 Chapter 3] and the references therein) is that Hx need not be bounded, allowing 
for arbitrarily distant points to be connected. This is especially relevant whenever Hx = axHi, 
where ax is a scaling factor and Hi is a fixed unbounded geometric connection rule. Unlike 
in the classical literature of stochastic geometry, e.g. in stabilization theory, this allows for 
models with unbounded interactions, such as between distant particles. As already recalled, our 
approach is further applied in [llj , where fZ-statistics with general stationary kernels (not only 
taking values or 1), and general order k ^ 2, are considered. 

For every A > 0, we shall write 

Fx = F{l,Qi;r]x,Hx) and Fx = Fil,Qr,7]x, Hx), (4.33) 



where we used the notation introduced in (1.2)-(1.3). With this notation, each ^Fx is a sta- 
tionary [/-statistic (see Definition 2.13[ ), counting the number of edges in the stationary random 



graph based on Hx (see Definition 1.4). The chaotic decomposition of Fx is written 
Fx = E[Fx] + Fi^x + F2,x, 



where we have adopted the same notation as in ( |4.25 ). Since g = 1, Problem 1.6 becomes the 



following: characterize all collections of sets {Hx} such that the CLT (1.3) takes place, and 
assess the rate of convergence in the Wasserstein distance. 

Remark 4.6 For every A > 0, one has the equality in law 

^a'= E Wc.'A>0, (4.34) 

x,y&rir\Qx,x^y 

where 7/ is a random Poisson measure with Lebesgue intensity, and Gx is a measurable subset 
of R'^ defined by the relation 

Hx = X-'^'Gx, (4.35) 
so that 

i(Gx n Qx) > 0. (4.36) 
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Remark 4.7 (Asymptotic equivalence notation) Given two mappings A i— )• 7a, A i— )• 5\, 

we write 7a S\ if there are two positive constants C,C' > such that C7A ^ <^a ^ C'jx for A 
sufficiently large. We write 7a 5a if (^a > for A sufficiently large and a\/5\ — t- 1. 

One of the main points developed in the present section is that the asymptotic Gaussianity 
of the class {-Fa} results can be effectively studied by using the occupation coefficient of Hx^ 
defined as 

V'(A):=£(FAnQi) = ^^^^^^, A>0. (4.37) 

We also write W = W — W = and W = Qi/2d (this is the largest set such that W — W W) 
and define, for A > 0, 

V'(A):=£(FAnW^) = ^— ff^, A>0, (4.38) 
Vi(A):=£(FAnW^) = ^^%9^, A>0. (4.39) 

In order to obtain necessary and sufficient conditions for asymptotic normality, we will often 
work under the additional assumption that 

i^{X)^i^{X). (4.40) 

In this case, one has trivially that tp{X) x ^(A) x ^^id the value of is only relevant up 

to a fixed multiplicative constant. 

Remark 4.8 (O- regularity) Assume that the geometric rule defined by Gx does not depend 
on A, i.e.: Gx = G for some fixed measurable set G, i n suc h a way that each set Hx is obtained 



by rescaling G by a factor X^^^^. Then, condition (4.40) is implied by the following stronger 
assumption: 'ip(aX) x ip{X) for every a > 0. In the terminology of [3, Section 2.2], this is 
equivalent to saying that ip is 0-regular. 



In view of using the bounds appearing in Theorem 4.2, we have the following crucial esti- 
mates: 

Theorem 4.9 Let the previous notation and assumption prevail, set Vi^x{^) = ^,A; ^ = 1,2, as 



well as Ax = Aa(1), Bx = Sa(1), • • • , Ex = -Fa(I) ^ see Section 4.1 The following estimates are 
in order for every fixed A > 0; 

2-'^ij{X) ^ A-Va^A = 2X-'^Dl ^ 2'^i){X), 
2-'^tP{Xf ^ X-^Vlx = A"^C| ^ 2'^V'(A)^ 
2-^V'(A)^ ^ X-^Al < 2^i;{X)\ 
X-^Bl ^ 2^V^(A)^ 
2-'^i,{Xf ^ X-^El ^ 2'^4,{Xf. 

Proof. We introduce the changes of variables denoted by (p^^\i = 0, 1, 2, 3, 4, where 
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= {xi 


u 


= Xl 


- X2), 














= {xi 


u 


= Xl 


- X2,V 


= Xl 


- X3), 










= {xi 


u 


= Xl 


- X2,V 


= Xl 


- X3,W 


= Xl 


— X4, Z = Xl 


- X5), 


(p^^\xi,X2,X3,X4) 


= {xi 


u 


= Xl 


- X2,V 


= X2 


- X3,W 


= X3 


- X4), 




ip^^'>{xi,X2,X3,X4,X5) 


= {xi 


u 


= Xl 


- X2,V 


= X2 


- X3,W 


= X3 


X4 1 -2' ^ 


- X5). 



Using the notation introduced in Definition 4.1 we have 



n H,) = j^^ l^,_^,,nJxidX2 = K^H^dxidu^ 



i{W^ n = / l^^_^^^Hl^^_^^^H^dxidx2dx3 = / \^-jjl^^-jj^dxidudv, 

= KeH,KeH^n.eH^zeH,dxidudvdwdz, 

= £(3)(^4) K^H,K<^H,KeH,K+v+u,^H,dxidudvdw, 
l{W^r\H^^^) = / l,^_,^^Hl,^_,^^Hl,^_,^^^Hl,^_,^^H^dxidX2dX3dxdx^ 



(4) ^'^e^^ ^''^^^ ^-'^^^ r^^Ti^dxidudv. 



Using the inclusions 



and 



-'-tteHA "'"t'eHA "'"w'ei? A +w'6-ff A ^ ■'■mgHa ■'■i'6-H'a ■'■uie-H'A ' '"■''"j''^ ^ 
we have 
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f{W X (Hx n W)) = 2-^i,{X) ^ X'^Vlx ^ 1^{W X (Hx n H^)) = 2'^V;(A) 
X (Fa n Wf) = 2-'^i){\f ^ \-^Cl ^ x (Fa n Wf) = 2'^i){\f 

£^W X (Hx n H^)^) = 2-^{X)^ ^ A-^^i ^ ^^(H^ X (Hx n W^)^) = 2'^i^{X)* 

\-^Bl ^ ^^(VF X (Hx n VF)^) = 2'^V'(A)^ 
1^{W X (Fa n ty)^) = 2''^^p{\f ^ A-^^l ^ i^{W X (Fa n W^)^) = 2'^^{X)^ 



and the result follows. 



The next statement provides one of the main results of this section: it gives an exhaustive 



characterization of the asymptotic behavior of Fx, whenever (4.40) is in order. In order to allow 



for a comparison with the existing literature, we classify the asymptotic behavior of Fx according 
to four regimes, denoted by (R1)-(R4). Such a classification is based on the proportion 
'(/'(A) of space occupied by Hx in the observation window, determining the influence area 
of a given point of the Poisson measure. This coefficient has to be compared with A~^, 
which corresponds to the total window measure divided by the mean number of points. 
The four regimes are the following: 



(Rl) Xi){X) and X^/ipiX) oo; 
(R2) A^(A) ^ oo; 

(R3) (Thermodynamic regime) Xipi^) ^ 1; 

(R4) The mapping A Aa/'?/'(A) is bounded. 

The thermodynamic regime corresponds (after rescaling) to the usual models where 
the geometry of the interactions does not change as the window of observation grows to 



the whole space (see Remark 4.6). We will see in Section 4.3.1 that, when specialized to 
Poissonized disk graphs, our asymptotic approximations and variance estimates concur 
with those obtained in [251 Chapter 3]. Under regimes (R2) and (R3), there is asymp- 
totic normality with convergence at speed A~^/^ in the Wasserstein distance. Under (Rl) 
the convergence to the normal law is slower, and under (R4) the asymptotic normality 
is lost: for any converging subsequence the limit is either Poisson or zero. 



Remark 4.10 One interesting contribution of Theorem |4. 1 1 1 appears at the end of Point 
(ii), where it is stated that, under the thermodynamic regime, both chaotic projections 
of the random variable Fa contributee to the limit and satisfy a joint CLT. This kind of 
phenomenon is an example of the "fine Gaussian fluctuations" appearing in the title of 
the paper. 



Theorem 4.11 Let {Hx '■ X > 0| be a fa mily of subsets o/M satisfy ing ( 4.32) and let 
ip, 1(1 be defined according to (4.38)-(4.39|. Assume in addition that (4.40) is satisfied, 



and consider a random variable N ~ ^(0, 1). The quantities introduced in Section 4.1 
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satisfy the following relations: there exist constants < k < K < oo, independent of X, 
such that 



XipiX), T72-^A ^ A-^/^ and 



^ Kms^(y^), (A2V'(A))-^/^ AVV(A))). 



Furthermore, one can choose K in such a way that the following properties (i)-(iii) are 
verified. 

(i) (Regime (R2) ) If Xip{X) oo, the first chaos projection F^^x dominates and 

Var(Fi,;,) ~ Var(F;,) x X^^Ij{X)^ ^ oo, 

and 

dw{Fx,N)^KX-'/\ 

(ii) (Regime (R3)) If Xip{X) x 1, 

Var(FA) X Var(Fi,A) x Var(F2,A) x A, 

and 

dw(Fx,N)^KX-^/^. 

In this case one has also that, as X ^ oo, the pair 



2,A 



VVar(Fi,A)V2' Var(F2,A)V2; ' 

converges in distribution to a two-dimensional Gaussian vector {Ni,N2), such that 
Ni ~ =^(0, 1) and Ni, N2 are independent. 

(iii) (Regimes (Rl) and (R4)^ If Xip{X) — )■ 0, then the second chaos projection F2^\ 
dominates, 

Var(F;,)-Var(F2,A)xA2V^(A), 

and 

dw{Fx,N)^ ^ 



Xy/^y 



Moreover, one has that F\ converges in distribution to N if and only if X^ip{X) 00 
(or, equivalently, Var^Fx) -^00). 
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Proof of Theorem 4.11, Theorem 4.2 and assumption (4.40) yield 

r2 



V, 



2, A 

vl 



2D 



X 



X 



AV(A) 
Bl ^ 2'^AV(A)^ 



(i) We assume XtP{\) — ?■ oo. Applying (4.24), one deduces that 



Vi 



KVX 
K 

7x 



max 



{X^^iX'f, A=^/2^(A)3/2, AV^(A), X'/'^{Xy/') 



for A large enough. Notice that (4.28) does not yield a better bound because ip{X) ^ 1. 



ii) Applying again (4.24), the conclusion is deduced from Point (i), because X^jj{X) ^ c 



for some constant c > and for A large enough. The last statement at Point (ii) follows 
from an application of Proposition |3.14 



(iii) Using (4.24) again yields 

1 



dw{F,N) ^ 



A2^(A) 
K 



max 



(a5/2^(a)2, x'^ixf, a3/2^, x^pixy/'') 



for A large enough, because Xip{X) — > 0. To conclude the proof, we have to show that, 
if X'^iplX) does not diverge to infinity, then Fx does not converge in distribution to A^. 



To prove this negative result, one could apply the product formula (2.7) to prove that 



whenever X'^ip{X) is not diverging to infinity and is bounded away from zero, there exists 
a sequence A„, n ^ 1, such that A„ — )■ oo and sup„-E[F| ] < oo, so that the desired 



conclusion is deduced from the last part of Theorem 4.2 (the case when X'^iplX) is not 
bounded away from zero can be dealt with by a direct argument). However, the statement 



of the forthcoming Theorem |4.14| is much stronger, and it is therefore not necessary to 
spell out the details of these computations. ■ 



Corollary 4.12 Assume that the geometric rule defined by Gx (see (4.35)j does not 
depen d on X, in su ch a way that Gx = G for some fixed measurable set G. Assuming 
(4.40) (see Remark 4.8), one has that Fx converges in distribution to N ^ -^(0, 1), with 
a rate at most of the order A^^^^ with respect to d^. 

Proof. We are in one of the following situations: 

1. If G has finite positive Lebesgue measure, tp{X) x ip{X) x i{G)/i{Qx) x A^"*^. It 



corresponds to the case (ii) in Theorem 4.11, meaning the two chaoses codominate 



It follows that Fx converges to the normal law with a rate at most of the order of 
X^^^"^ with respect to dw- 
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2. If G does not have finite measure, 
situation of Point (i) of Theorem 
therefore deduce that 



dw{h,N)<:K\-'/\ 

for some > 0, and the conclusion follows. 



4.111 that is: 



—7- oo and we are in the 
the first chaos dominates. We 



As announced, we shall now deal more thoroughly with the case where X'^iIj{X) does 
not diverge to infinity. In the proof of the next statement we shall use the following 
notation: if X is a random variable with finite moments of every order, then we write 
: m ^ 1} to indicate the sequence of its cumulants (see [2H Chapter 3] for an 
introduction to this concept). For instance, Xi(^) = -^[^]) X2{X) = Var(X), and so on. 



Remark 4.13 The proof of Theorem 4. 14| provided below is based on diagram formulae 



and the method of moments and cumulants. An alternate proof could be deduced from the 
classic results by Silverman and Brown [36], combined with a Poissonization argument. 
Another proof of this result, complete with explicit bounds in the total variation distance, 
appears in [23]. The proof provided below has the merit of illustrating an application of 
diagram formulae (that are typically used to deduce CLTs) to a non-central result. 

Theorem 4.14 (Poisson approximations) Let the assumptions and notation of The- 
orem 4.11 prevail, and assume in addition that Xip{X) — )■ and the mapping X ^ X'^iplX) 
does not diverge to infinity, as X ^ oo. Then, there exists a sequence {A„} such that 
lim„_j.oo Xn = oo and the following properties (i)-(iii) hold. 

(i) lim^^oo Var(FA„) = lim„_!.oo Var(F2,A„) = 2c, where c is a nonnegative constant 



possibly depending on the sequence {Xn}, and we have used the notation (4.25). 

(ii) lfc = 0, then E[\FxJ] -> 0, as n ^ oo. 

(iii) If c > 0, then -F2,a„7 and therefore — E[Fx^, converges in distribution to X = 
2P{c/2), as n ^ oo, where P{c/2) indicates a centered Poisson random variable 
with parameter c/2. 

In particular, the family {Fx} does not verify a CLT as A —t- oo. 

Proof. Since Var(FA) ~ Var(F2,A) ^ A^'?/'(A), Point (i) is immediately deduced from the 
Bolzano- Weierstrass theorem. Point (ii) follows from a direct application of Campbell's 
Theorem (see [32^ Theorem 3.1.3]), yielding that, as n — )• oo, 

E[FxJ 



VaT{Fx„y/^ 



XnVH>^n) ^ 0. 



We shall prove Point (iii) by using the method of cumulants. First of all, we observe that 
since A^^/'(A„) is bounded and bounded away from zero, one has that V^^^ x A^'?/'(A„)^ — ?■ 
0, that is: as n — oo, the limits of F2,A„aiid Fx„ —E[Fx„] coincide. We recall that the law 
of the random variable X = 2P(c/2) is determined by its moments or, equivalently, by its 
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cumulants (see e.g. pp. 42-43]). Standard computations imply that = and, 

for every m ^ 2, Xm{X) = 2™~^c. We are therefore left to show that, for every m ^ 3, 

Xm{W2,xJ) (4.41) 



where /2,a is the kernel appearing in formula (4.25). Our proof of (4.41) is based on the 
use of the so-called 'diagram formulae' for multiple Poisson integrals as stated in [211 
Corollary 7.4.1] (observe that this statement only deals with simple kernels: however, 
one can immediately extend this result to our framework by using e.g. Surgailis [STf 
Theorem 3.1]). Fix m ^ 3, and introduce the following notation and definitions. We 
shall write [2m] = {1,2, ...,2m} and indicate by ttq the partition of [2m] given by ttq = 
{{1, 2}, {3, 4}, {2m — 3, 2m — 2}, {2m — 1, 2m}}. Given a partition tt of [2m], we write 
|7r| to indicate the number of blocks of vr. The function {f2,\)-Ki in |7r| variables, is obtained 
as follows: (1) consider the function 

$(Xi, ....,a;2m) = f2,\{Xl,X2) X f2,\{Xz,Xi) X ■■■ X f2,\{x2m-l,X2m), 

given by the juxtaposition of m copies of f2^\i and (2) identify two variables Xi^Xj in 
the argument of $ if and only if i and j are in the same block of vr. According to 
Corollary 7.4.1], one has therefore that 



Xn^(/2(/2,Aj) = ^n' / (^.aJ. rf^'"' 



where the symbol Aim stands for the class of those partitions tt of [2m] satisfying the 
following properties: (a) every block of vr contains at least two elements, (b) given any 
two blocks 6o £ ttq and 6i G vr, the intersection 6o H contains at most one element, and 
(c) the diagram r(7ro,7r), as defined in Section 4.1], is connected in the sense of [2^ 
p. 47]. There are exactly 2™"^^ partitions vr G Aim such that Ivrl = 2, and for any such 
partition one has that 



Alrl / (/2,Aj.rf£W = ||/2,Aj|i.((A„.)2)-^C. 

On the other hand, if vr G M.m and |7r| ^ 3, a change of variables analogous to the ones 



defined in the proof of Theorem |4.9| yields that, for some constant C independent of n, 

^ AW / U2,xXM\^\ ^ CAWV^(A„)W-i =C\li^{K) X (A„^(A„))W-2 ^ o, 

J(IRd)KI 

thus concluding the proof. 



4.3.1 Two examples 



We now present some explicit examples. The notation of Section [4^ will prevail through- 
out this section. 
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Example 4.15 (Disk graph) Assume 



for some rx > 0, meaning tliat two points of t] in Qx are connected whenever their distance 



is smaller than rx- It yields ipiX) x ^x/^ (i^ easy to verify that (4.40) is satisfied). 
Then Fx is asymptotically normal iff Xrf — )■ oo, and 

dw{Fx,N) <: C\-'/^max{l,r;'^^^). 

According to the classification based on the four regimes (R1)-(R4), the above result 
yields the following exhaustive description of the asymptotic behavior of Fx (note how 
we are able to distinguish the contribution of each chaotic projection) : 

(Rl) If ta — and Xrf — )■ oo, then Var(FA) x Xrf, Fx satisfies a CLT with an upper 
bound of the order of (Ar^)^^/^ on the Wasserstein distance, and the projection of 
Fx on the second Wiener chaos dominates in the limit. 

(R2) If — 7- oo , then Vai{Fx) x A(r^)^, Fx satisfies a CLT with an upper bound of the 
order of A^^^^ on the Wasserstein distance, and the projection of Fx on the first 
Wiener chaos dominates. 

(R3) If rx X 1, then Var(FA) x A, Fx satisfies a CLT with an upper bound of the order of 
A^^/^ on the Wasserstein distance, and the projections of Fx on the first and second 
Wiener chaos both contribute to the limit and satisfy a joint CLT. 

(R4) If Xrf converges to a finite constant c ^ 0, then Fx converges either to zero (if 
c = 0) or to a multiple of a Poisson random variable (if c > 0). 

Remark 4.16 Explicit estimates on the variances and on the rates of convergence to 
normal in the Wasserstein distance for edge counting statistics are also provided in [31| 
Theorem 6.3]. Their variance estimates are of the order 

max(A35f , A^^^) = max{X^^{Xf, X^^{X)), 

corresponding exactly to our findings {5x = X^^^'^rx = 'ip{XY^'^ in their notation). They 
obtained convergence to the normal law if 5^ — and X^^^Sf — t- oo, whereas Example 



4.15 yields a CLT whenever X'^ip{X) oo, dealing with the cases limsupA^A > and 
A"^/'^ = o((5a), Sx ^ X^'^^^'^'^\ and gives a negative answer if 6x ^ CA"^/'^ for some C > 0. 

Remark 4.17 For every fixed A, the [/-statistic ^Fx has the same law as the random 
variable counting the number of edges in a disk graph, with radius 6x = X'^^'^rx, based 
on random points of the form {Yi, ...,Ym{x)}, where {Yi} indicates a collection of i.i.d. 
random variables uniformly distributed on Qi = [— ^, and A^(A) is an independent 
Poisson random variable with parameter A. As such, each ^Fa is just a subgraph count- 
ing statistic based on a Poissonized random geometric graph, and enters the general 
framework of [251 Section 3.4], where general m-dimensional CLTs are obtained for these 
objects. It is immediately checked that our variance estimates coincide with those stated 
in [29^ p. 56] (for the case k = 2), whereas our estimates in the Wasserstein distance 



28 



refine the findings of Theorems 3.9 and 3.10] (in the case k = 2 and m = 1), where 
no information on the rate of convergence is given. Previous references for CLTs for Pois- 
sonized disk graphs are j2[[10], where no exphcit rates of convergence are provided either. 
A generahzation of the previously described findings to general subgraph counting in a 
disk graph model can be found in [TT| Section 3]. 

Example 4.18 We present here examples of geometric stationary graphs with arbitrarily 
long con nectio ns and behaviors span ning t he whole spectrum of possibilities indicated in 
In all cases, condition (4.40) is easily checked. We define G C as in Fig. 



4.11 



Theorem 

[l} symmetric with respect to all axes, and consider different choices for /, a monotone 
function i— )■ M^. We set G\ = a\G with different values for ax > 0. We refer the 
reader to Fig. |2] for results of simulations, where different values for A have been used. 

(a) If f{x) = 1/x, oa = 1, then Xipi^) ^ log(A), whence there is a CLT with an upper 
bound of the order of A~^/^ (Situation (i) of Theorem 4.11), see Fig. 



2a 



(b) If f{x) = 1/x^, a\ = 1, then Xipi^) ^ 1, whence there is a CLT with an upper bound 



of the order of A (Situation (ii) of Theorem 4.11), see Fig. 2b 



(c) If /(x) = l/a;2, ax = X'^^^, then A^(A) x A^^/^ whence there is a CLT with 



an 



upper bound of the order of A (Situation (iii) of Theorem 4.11), see Fig. 



2c 



(d) If f(x) = 1/x, ax = X then X'ijj{X) x log(A)A ^, whence there is a CLT with an 



upper bound of the order of log(A) ^ (Situation (iii) of Theorem 4.11), see Fig. 2d 



(e) f(x) = l/x'^,ax = X ^Z^. According to Theorem 4.14, Fx converges in distribution 



to a multiple of a Poisson random variable, see Fig. 2e 



In all cases, the convergence to a normal law goes hand in hand with the almost 
sure convergence of the number of connections to infinity, and with the convergence of 
the variance to infinity. In the case (d), the convergence is very slow, the number of 
connections behaves asymptotically like a Poisson law with parameter log(A), due to 
the long-range connections within the point process. In the case (e), the asymptotic 
properties of Gx do not yield long range connections and the number of connections 
converges towards a Poisson-type limit. 
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Figure 1: General form of Gx = {{x,y) : \y\ < \f{x)\ and \x\ < \f{y)\}- 



30 



Figure 2a: A = 25, A = 150 




Figure 2b: A = 25, A = 150 
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Figure 2d: A = 10, A = 50 



Figure 2e: A = 10, A = 50 
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5 Appendix: Malliavin operators on the Poisson space 

We now define some Malliavin-type operators associated with a Poisson measure rj, on the 
Borel space {Z, with non-atomic control measure /x. We follow the work by Nualart 
and Vives 



The derivative operator D. 

For every F G L'^{P), the derivative of F, DF is defined as an element of L^{P] 
that is, of the space of the jointly measurable random functions u : Q x Z \-^M. such that 
E [J^ulfi{dz)] < oo. 

Definition 5.1 1. The domain of the derivative operator D, written domD, is the set 



of all random variables F G L'^{P) admitting a chaotic decomposition (2.4) such 
that 

2. For any F G domD, the random function z i— )■ D^F is defined by 

oo 

D,F = Y,kIk-i{fk{z,-))- 



The divergence operator 5. 

Thanks to the chaotic representation property of r], every random function u G 
L'^{P, admits a unique representation of the type 



J2Wkiz,-)), zeZ, (5.42) 

where the kernel fk is a function of k+1 variables, and fk{z, ■) is an element of LKh^). The 
divergence operator 6{u) maps a random function u in its domain to an element of L'^{P). 



Definition 5.2 1. The domain of the divergence operator, denoted by domS, is the 
collection of all u G L^(P, L^(yu)) having the above chaotic expansion (5.42) satisfied 
the condition: 



J2(^+mfk\ 



L2{^(fc+l)) 



< oo. 



2. For u G dom.5, the random variable 6{u) is given by 



where fk is the canonical symmetrization of the k + 1 variables function fk . 

As made clear in the following statement, the operator 6 is indeed the adjoint operator 
ofD. 
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Lemma 5.3 (Integration by parts) For every G G domD and u G domS, one has 
that 

E\G5{u)] = E[{DG,u)m^)]. 



The proof of Lemma 5.3 is detailed e.g. in [22j. 



The Ornstein-Uhlenbeck generator L. 

Definition 5.4 1. The domain of the Ornstein-Uhlenbeck generator, denoted bydomL, 
is the collection of all F G L'^{P) whose chaotic representation verifies the condition: 

k^l 

2. The Ornstein- Uhlenbeck generator L acts on random variable F G domL as follows: 

LF = -Y,khUk). 

k^l 

The pseudo-inverse of L. 

Definition 5.5 1. The domain of the pseudo-inverse of the Ornstein-Uhlenbeck gen- 
erator, denoted by , is the space Lq{P) of centered random variables in L^{P). 

2. ForF=Z hUk) G Ll{P) , we set 



k>l 



L-'F = -Y,\Wk). 



k 

k^l 
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